Predicting the Gender of Indonesian Names

نویسنده

  • Ali Akbar Septiandri
چکیده

We present a new way to predict genders from names using character-level LongShort Term Memory (char-LSTM). We compared our method with some conventional machine learning methods, namely Näıve Bayes, logistic regression, and XGBoost with n-grams as the features. We evaluated the models on a dataset consisting of the names of Indonesian people. The results show that we can achieve 92.25% accuracy by using this approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Gender Issue in Masculine Sports in Indonesia: A Case Study

Background. Womenchr('39')s participation in sports has been improved in terms of number and types of sport. This improvement is realized due to culture change as an impact of direct dissemination of family of athletes. During the past 15 years, the change seems to be evident where on the 2005 Sea Games, Indonesian female athletes had the opportunity to participate in the event, even though the...

متن کامل

Conceptual Representation Changes in Indonesian-English Bilinguals.

This study investigated conceptual representations changes in bilinguals. Participants were Indonesian-English bilinguals (dominant in Indonesian, with different levels of English proficiency) and a control group composed of English-dominant bilinguals. All completed a gender decision task, in which participants decided whether English words referred to a male or female person or animal. In ord...

متن کامل

Truncation in Indonesian: Evidence for Violable Minimal Words and ANCHORRIGHT

Indonesian exhibits parallel patterns of truncation in the short forms of terms of address and the short forms of personal names. As illustrated in (1a), many common terms of address are formed from the base of a kinship term and appear in both a "long" and "short" form. It is the short forms that are understood here to be truncated. For personal names, a similar pattern is seen where a vocativ...

متن کامل

Affective Meaning, Concreteness, and Subjective Frequency Norms for Indonesian Words

This study investigated the lexical-semantic space organized by the semantic and affective features of Indonesian words and their relationship with gender and cultural aspects. We recruited 1,402 participants who were native speakers of Indonesian to rate affective and lexico-semantic properties of 1,490 Indonesian words. Valence, Arousal, Dominance, Predictability, Subjective Frequency, and Co...

متن کامل

Naturalization in Translation:A Case Study on the Translation of English-Indonesian Medical Terms

Naturalization is a translation procedure that is predominantly utilized in the translation of English medical terms into Indonesian. This study focuses on identifying types of naturalization involving the adjustment of spelling and pronunciation and investigating whether naturalization has been appropriately applied based on the rules in the Indonesian general guidance of term formation. The d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1707.07129  شماره 

صفحات  -

تاریخ انتشار 2017